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(54) Method for processing a digital image 

(57) A method for processing a digital image using 
an imaging input device so as to reduce bleeding of the 
edges of the detected images comprising receiving a 
digitized image including at least one of a plurality of 
representations of objects, and processing the digitized 



image into at least one of a plurality of images of the 
digitized image. The processing step includes defining 
a plurality of bins having a plurality of edge points mak- 
ing up each of the plurality of images of the digitized 
image. 
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Description'- ■ i'' • ■■ " ' '^!'= -''^ 

[0001] This invention relates generally to detection of particular images of several documents placed on the platen 
of an image -input'device in the presenisb of severe backgrocind bleeding" and more particularly to the automatic de- 
tection and seghneritation of' the' series of irfiages placiBti bn the plateh usin^ the technologies of joining orphaned 
iiTiag'esJinear regression atgbrithms 'and col^ ' " 

[0002] ' To' ihhprove- the ease of use of scanning' s^^^ copiers and scanners, it is desirable to 

•autbrtiatically detect^and segment scan ned dbjects^f e g:; ^ dn'Xhe 'pf^ten of the input scanner 

To facilitate this' iSutohiaited detection- and' se^riientationj' it1s'hecessary''to ideritify the positjon, shape and rotation 
- angleof the object: Sijch'acapabllitf can 
enable multiple images to be-efficiently scam &d and ted uc for accurate'^resentation placed upon 

document feeding mechanisms. 

[0003]^ The- following dbcMents^descn US-A-5, 528,387, US-A- 

!4;922,350, -lJS-A-5;i253765r US-A-5,22^ Planar' Objects by Primitives." published in 1996 

IEEE Proceedings oflCPR '96, pp. 508-512 Voss eVal and*'MANAG I NG AfSlD REPRESENTING IMAGE WORKFLOW 
rKl' PREPRESS- Af^PtlCATlONSi^ Te^^^ (TAGA) Vol. 1, 1995 Proceedings pp. 

373-385. ■■■■ -■Oc.iv. ^- r;^. ^ c-.- -^r'' rr^ .: .r ■ 

10004]'^ In aceordahee'with t^^^^^ there'is prbvided a method for processing a digital image using an 

imaging-input device sbas to reduce' bl^edirig of the edges bf the detected images comprising receiving a digitized 
irViage including at least one of a plurality^ of repi^esehtations of oBjects; and processing said digitized image into at 
least one of a plurality of imagjesof said digiiizied imaige Wherein- said processing step' includes defining a plurality of 
• bins having arplui-ality of edge points making up ^ pi'urality of images of said digitized image. 

[0005] - The preserit invention is directed to a sy^fe'irl interidecl to autbmatically determine the boundaries of multiple 
objects within a scanned image of a series of documents, the present invention combines a number of graphics and 
image processing fechniqiics into an' autorriateld, system that 'provides productivity 'enhancement tor digital copiers 
and scanrnihg^systems; ■■^^'"^ ' ' - ^ . ^ ^ ^ 

[0006] The present ihvention accomjDlisfies these object i\)es by: ^1 ) locating each of the objects within an input image; 
2) modefing the shape bf the ideritified object (e.g., rectangtej^ and 3) positioning th object in a digital image at a 
predefined locations. ■ ' ^ ; ; ^ ^ ^' ■ 

[0007] Further, in accordance with the present invention there is provided a method for processing a digital image 
using an linagiri^ in jDut device reducing bleeding of 'the edges of the 'idetected images by determining edge contours 
to generate bounding boxes comprising: identifying bins having edges that approximate 90 degrees, and modeling a 
rectangular irriage fitting tor bins that appro)^imate the 90 desgrees 

[0008] Further, in accordarice with the present invention there is provided a method for processing a digital image 
comprising the steps 'of :- receiving a digitized image of a plurality bf representations of objects; and processing said 
digitized image into a plurality^ of images representing said digitized' irriage wherein said processing step Includes 
generating'a piura1ity-^of bins wherein said bins have a plurality of edge points making up the boundaries of each of 
said plurality of images of said digitized image. 

[0009J ' Final lyrin'^ccbrdance with the present ihvehtibn there is provided a method for processing a digital image 
using an imaging input device reducing bleedirig of the edges of the detected images by determining edge contours 
to generate bounding 'bo)^es comprising: identifying bins 'on 'said edge contours having edges that approximate 90 
^degrees,' and i^^^^ 

'[001 0] * TH'e'te'chhiques described herein enable a user to expediently scan a document or other object, automatically 
recbghiiiri^lhe shape bf the document within the digitized image, and composing a digitaMoc'ument incorporating the 
object. 

[OGi'l] the techniques described above are advantageous because they improve the efficiency of a scanning proc- 
■ essV'iri" the presence of severe bleeding. ' 

[0012] Sbrhe^^xarhples of methods according to the present invention will now be described with reference to the 
accorri pdny in g drawings, in which:- . ' 

Fi^ure't-is an' iHustration of an exemplary'digital color printing ^systern seWihg as an eml^odjrhent for the present 

inventiohV^'- ' ^ ~ ' ■ " ; ' ' yV -M.^.^ . -..y-.. 

' ■ ■ ^ Figui-'o 2--is k'-blbck diagram of1he vsi^ious components corjiprisr^^^ v"^^^'"^ 

•■•^^ ■■Figure^3ls ah illustration of the p lac 6 Went of a dbcument'objec^^ in Figure 1 ; 

''^ Figure!4 is a flowchart illustrating the geri^t^al prbcessi'ng steps carried out on jhe sysyem, of Figures 1 and 2 in 

accordance with the present invention; J ' ''!^-l~r^'- 

■' ■ Figure 5 is a detailed flowchart illustrating 'ai'sehes^o^^^^ steps carriedouHpJn.itially d^^^ the "seed" 

' oftheimages; ■ ' <,.: . - =0 ^.^^ .j.^..-.;* 
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Figure 6 is a more detailed flowchart showing the steps of determining the "seed". of the images; 

Figure 7 is a detailed flow chart illustrating series of processing steps carried out in accordance with Ihe present 

invention; . , . . . ..... 

Figure 8 is an illustration of line, estimation ja accordance with the presenUnvention; . , - 
5 Figure 9 is an illustration of,a bin categprizatioq in accordance with the present invention; • 

Figures 10A, 10B. and IOC are JlJustrations of- a bin removal operation in,accordance with the present invention; 
Figure 11 is an illustration of a midpoint averaging between bins in .accordance with the. present invention: 
_ Figure 1 2 is a detailed illustration of a step of the- flowchart. of Figure 6 in accordance with the present invention; 
Figure 1 3 is a detailed illustration.of a steg of theJowchart of Figure 6 in, accordance with the present invention; 
70 Figure 14 is ari illustration of an overlap detection of inriages in, accordance with thepresent.invention; and. 

Figure lS is a flowchart of the overlap detectiqq.^^ 

[001 3] For a general understanding.of the presenUnvention,. reference is made Jto the drawings; In the drawings, like 
reference numerals have been used throughout to designate identical ej^ements- In descn'bing the present invention, 
the following ternn(s) have. been used in the description. . ^ , " » : - - ' - . 

[0014] The term "data" refers herein to physicaLsignals that indicate or include:Jnforrnatlon. When an item of data 
. can indicate one of a number of possible alternatives, the item of data has one of a number, of "values." For example, 
a binary item of data, also referred to as a "bit," has one of two values,, interchangeably, referred to as T and "0" or 
"ON" and "OFF" or "high" and "Ipw." A bit is an "inverse.^ of- another bitjf the two bits.have different values. An N-bit 

20 item of data has one of 2N values. A "rnulti.-bil" itern of data is an item of.dala,lhat includes more than one bit. 

[0015] "Memory circuitry" or ."menriory" js any cirpuitryjhat can store data, and may include local and remote memory 
and input/output devices. Exarriples iriclude semiconductor ROf^s, RAry/ls,.and storage, medium access devices with 
data storage media that they^can access. A "memory cell" is ^memory circuitry that can store a single unit of data, such 
as a bit or other'n-array digit or an analog .value. . - . . ^ 

25 [0016] A signal "indicates" or "selects" pne of a sot.of alternatives if the signal causes the indicated one of the set 
of alternatives to occur. For example, a signal can indicate one bit set in a sequence, of bit sets to be used in an 
operation, in which case the signal causes, the indicated bit set to be used in. the operation, . ■. 
[001 7] An image may include characters,, words, and text as well as other features such as graphics. A text may be 
included in a set of one or more images, such as in images of the pages of a document., An image, may be processed 

30 so as to identify specific witl;iin the image, each of which is ap irriage. * . - n 

[001 8] A "bin" is a list of edge points that are approximately ^collinear. A list of edge points is.converled into a sequence 
of bins that represent the boundaries as straight lines. . --r- 
[0019] An object may be of any size and shape and has "physical attributes' or characteristics including, but not 
limited, to position, shape and orientation, For example, an. object may be a document, when the .document is placed 

3S on the platen of an image input device and the region of the platen, is digitized to produce a-representation thereof. 
[0020] An item of data "defines" an image when the item of data includes sufficient information to produce the image. 
For example, a two-dimensional array can define all or any part of an image, with each item of data in the array providing 
a value indicating the color of a respective location of the image. • , • • . : . - • : 
[0021] An item of data "defines" an image set when the item of data includes sufficient information to produce all the 

40 images in the set. . . ; . / . 

[0022] Each location in an image may be called a "pixel." In an array defining an irriage .in .which each item of data 
provides a value, each value indicating the color of a location may be called.a;"pixel value". -Each pixel value is a bit 
in a "binary form" of an image, a gray scale value in a "gray scale forrri" of an image. iOi;,.a set of colpr space coordinates 
in a "color coordinate form" of an image, the binary form, gray scale form, af;td cblqr coordinate, form .each being atwo- 

45 dimensional array defining an image. " 

[0023] An operation performs "image processing" when it operates on an iterri of data that relates to part of an image. 
[0024] Pixels are 'neighbors" or "neighboring" within an image .when there-ar,e.no other pixel^ between them and 
they meet an appropriate criterion for neighboring. If the pixels are rectangular and appear in rovys and columns within 
a two-dimensional image, each pixel may have 4 or 8 neighboring pixels, depending pn.the criterion used. 

so [0025] An "edge" occurs in an image when two neighboring pixels have sufficiently different pixel values according 
to an appropriate criterion fpcthe occurrence of an edge, between them. The terms Tedge. pixel" or "boundary pixel" 
may be applied to one or both of two neighboring pixels between which an edge occurs. . 

[0026] An "image characteristic" or "pharacleristic" isanneasurable attribute of ari image. An operation can "measure" 
a characteristic by producing data indicating the characteristic jusing data defining an image. "A characteristic is meas- 
ured "for an image" if the characteostic is.rneasured in a manner that is likely to produce approximately the same result 
each time it occurs. 

[0027] A "version" of a first image is a second Jmage. produced using an item of data defining the first image. The 
second image rinay be identical to the first irnage, or it may be modified by loss of resolution, by changing the data 
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' defining the first irHage;bV'by 6tHfe^^^ thaT resuife ih rinodifying pixel values of the first image. 

[0028] An "image input device" is a device that can receive an image and provide an item of data defining a version 
of the iniage. A ''scanhSr"'is'dn Inhage input device'that receives ah image by a scanning operation, such as by scanning 
a document/ A scanner 'may have a transpaYent suflace' (piaten) or e support a document during 

scanning. Other wbll-khbwn irriagei' input devices^'iric'lude digital baineras. facsimile- machi'nes, and video recorders, or 
any'image storage 'device' having the capability't'o sitore data si-ghals" representative of the intensity of light reflected 
'from the'SUKac'e of objebts at which the device is direcWd.; A general' p^ computer with an image creation program 
rhay also be an image input device. ■ ' .vo ■ ^-r; ; ; . . . -v^-/;/ 

[Q029] An "image^biitput device" is a device that cM receive a'nltem^ defining an image and provide or render 
the ttTia'ge-as"6utput:^'A "display" is an irri'age^butfiirt cfe^^^ the output image in human viewable form, 

' ahd a "printer" is airr rmagW odtpu^ device thait renders the' out put; i^ in a huiriah viewable, hard copy form. 
[0030] Referring nbwtb Figures t ahd 2, depicted therein is a digital print ihg^ 20 in which the present invention 
finds particular use. 

[0031] For exarrigle, system 20 rpay be a Xerox-, Doc u Col or® 40 Digital Color Printing System or the Xerox 5775® 
Digital Color Cbpier. System 2d'includes a computer or data processing unit 22 (Figure 2) capable of receiving digital 
data representing an image of an original document 24 placed upon a platen of scanner 26. Computer 22 initially stores 
"the digital input dat^ 'from scanner 26 in rTi^mbrv^'52 (e .'g.^F^AK^ or magnetic diS^ image may sub- 

^ sequently be accessed: Iri 'addition tb ihe digit^rciati^ merrrbry 52''may also include program memory for the storage 
of object code suitable for directing the processor to execute Image processing operations in accordance with the 
invention described hbirein. Cbrhputer 22 has a^sotiated therewith a user interface (U/l) 28 including one or more user 
input devices 30; stich as a kfeybbard, a keypad,- a mbuse trackball, stylus or equivalent pointing device, etc. 
[0032] Also pah of' sysfe'm 20 is a colbf irfiage output dbvice' Such as printer 34 which may include a laser-driven, 
xerographic printing engine astound in a number of commercially available printers. In a preferred embodiment, system 
20 is employed to process the digital image data received as input from a scanner 26, utilizing image processing 
software running in prpcxp.ssor .50, so.as, to produce an output file that may be rendered by printer 34, stored in memory 
50, and/or transmitted to another device via network 40. Although system 20 is depicted as an integrated unit in Figure 
1 , it will be appreciated that the system may also comprise a plurality of independent yet interconnected units. Referring 
tb Figure' 3, it will be appreciated that the document placed upon the scanner jolateh in system 20 may not be accurately 
registered along the re'gisti-ation edges 25. For example^ a particular' embodiment tb which the lollowing description 
will be birected' is a sirigle documerit object 6 placed in a central location on the platen of''scaririer 26 as illustrated in 
Figiii-e'S: In accordance with the prWseht invention it isdesired to autbmatically identify the position, shape and rotation 
angle of object 0. In a facsimile transmission system using aset of rollers to advance a document through a scanning 
nip, it wilf be appreciated that the document'may become skewed during the digitization process and that the present 
application rriay have particular application to such systerns as well. " . " 

[0033] Given an input Inhage generated by scanning the scanner platen, the present invention automatically identifies 
at least the position, shape and orientation ^^ngle of the object. As shown in the flowchart of f^igure 4, the process 
carried out by bomputer 22 during the processing of the thput indage Includes three' general steps. First, at step 100 
the object within the image is located and boundaries' of thebbject are generally'ldentified. 6nce the object is located, 
the shap'e' brthbobj^ctls'mb^ at step'2dO. Having* tocatbd the object and modeled its shape',' a structured image 
or siinilar digital dbbunn^h't format represehtintrthe image and the object therein can be created as represented by step 
'300: The-structuredifflage inc I udes^dataVep resenting not the image data itself, but data representing 

the fbbatibn.'-siiape or brientation of the object/ or sorrie bomb ihat ion thereof. Alternatively, the output may be a page 
descriptibh M'^Uag&'f brrnai or equii^alents formats suitabib for storing the image information^ in a retrievable form. 
'[OS34]''-'ln'an er^hbo^^ present invehtioh; 'the scanned' ihpu^ a lower resolution version thereof) 

is loaded into a merribr^ frame ■'b'uffer'{R'^^ it is anaiyzed lri accordance with the;previously described steps. 

For'purposes' bf tftb''f61lowing detailed' description, if is assumed that the object is distinguishable from the image 
■background (e'g:, the' platen cover) and that the background bf the im^ge is cbntiguou^. These simplifying assumptions 
■are ihlerided fbr;purpDses of eXpiancilion bnly and are nbl intended as lirhilations of the invention. One skilled in the 
art will appreciate that the inVehtion'described hereinIs extensible so as not to require operation bh ly within the bound- 
aries of such assuhnptions. ' ' ■ ' ; ■ " ' ■ ' ' - : v- . ■ • 
[0035]-" As'd#lcted''by the flowchart sfpf FVgii res Sarid 6; the object location step 100 is'pertdrmed by first identifying 
the backgrbuhd'regioh^of the input image "1 02, ch&racfei-izirig tHe'backglrburid regibnJ 04,' and the^ using the charac- 
teristic of the background region as a seed.Mdentifying ali the pixels i^bp.reieritihg the backgrbuhd region with an adaptive 
'dieed "fiil atgorithm 'l 06. Background 'pixels^'are pixbfs'not assbciatbd 'simply, they are pixels 
■Representative bt those regions l^ing outside 'bfmW'o'bj^^^^^ values of which are contrblled.by the "background" 
against which the o'tDjects are placed during scariiiing;|e^^^^^^ ihe underside of the platen cbver)V'^^^^ embodiment em- 
ploys the'average coior of a snhall regibh in%e' upgbr.i^^^^ corher of the scanned imagaa^ initial estimate of 
the background color. Alterhativeiy, other sampli'n'g'"bp^r'm^^^^ may be employed to' de^errri in e'the background color 
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as described, for example, in US-At5, 282,091 for a Programmable Apparatus for Detenpining Document Background 
Level by FarrelL _ . . . ^ ^ - 

[0036] Once the background.color is characterized at step, 104. an adaptive algorithm is preferably applied to monitor 
the background color and to^accuratejy identify the objects. An example of a seed-fill algorithm suitable for use in the 
5 present invention is described in Graphics Gems . I. A. Glassner Ed.. Acad'emic Press, pp.. 275-277. 1990, hereby 
incorporated by reference. An adaptive algorithm is required because the background pixels may have significant color 
variation resulting from^ a variation Jn illumination, oyer the platen area. The adaptive seed.fi M algorithm is applied to 
the scanned color image data using an initial seed point characterizeid by ^the.background: for example, the upper-left 
corner of the image. Generally, the adapt iye seedfilLalgprithm ms^ a..binary_ .'frame buffer with .a mask indicating all 
, 10 contiguous pixels ideniified as background pixels. Jna^.s^^^ represented by step 112, a pixel is consid- 

ered to be .a backgrqund pixel ft its color falls withiri.a small di?.tance e .of4he current average background pixel value. 
This distance is calculated as' ari Euclidean metric iri. red, green. b!ue,.(RGB) pplor space., 

where P^, AdAyg^ are. respectively, the RGB..cpmponegts of. the^pixel under test and the average background 
value, and d is the distance measurement. The value of,£ is fixed and ernpirically determined in one embodiment. The 
test conducted at step 1.1 2 is: , , . . > - ' - • 

^0 if d<e, then pixel P is a background pixel., else pixel; P is a foreground pixel. 

[0037] The average background color is adaptive ly nfiqd if ied at step 114 by taking the average value of the last N 
pixels that have been classified as background. For efficiency, the system preferably calculates the adaptive average 
using the equation: ' . 

25 

' " ^ AdAvg' := (N*AdAvg-AdAvg+LastVal)/N, '-- ^ • ■ - 

where AdAvg' is the rnodified average, AdAvg is the" previous adaptive average, LastVal is the value of the last pixel 
identified as background, and N is the averaging window Clearly, this is not a true running average, but it tracks the 
running average adequately and is more computationally efficient than a strict running average calculation. Alterna- 
tively, the value of e can be adaptively modified. For example, t might be. based on the standard.deviation of the last 
several pixels identified as background, etc. 

[P038] It will be appreciated that alternative methods for the detection of backgrpund regions may.be employed, and 
that the process of obtaining a binjary map distinguishing the objects from the background may be accomplished using 
a simplified thresholding operation based^upon the background color In a preferred embodiment, t^ie background color 
may be a function of the color of the platen cover, such as a white cover. As a further exarnple, the digital color production 
system may employ a spectrally reflective or.otherwise distinguishable platen cover that results in a background color 
that may be. detected and distinguished from the objects. 

[0039] Having identified background pixels and created a representation qf the/object/boun.daries, noisy edges can 
be optionally smoothed using,.for example, morphological filtering. Subsequently, contiguous foreground regions are 
located.'step 122, thereby identifying the, objects. Scanning the background mjask ge~nerated-by the adaptive seed fill 
operation (step 106) identifies objects. Starting with t^e upper left harid pixel, 1h^. mask is. searched in a scan line 
fashion for a pixel not classified in the mask as,a background pixel - thus identifYing pixejs associated with a foreground 
object. The use of the.seed fill algorithm for ideritifying.the background assures' that'for^ are closed, or 

in other words, a complete. boundary is formed about the perimeter of the. object", * V -/ . • = V .| - 
[0040] At step 124. tracing its edge' identifies the tDoijndary of an object. The . bpund'ary of the foreground object is 
traced using a simple 8-connected edge traversal opera to> which provides an prde red-set of points tracing the edge 
of the object. Such an edge traversal operation employs a contour, tracing operation to .generate a chain code in a 
manner similar to word or character "based recognition systems. R. Bo2inovic/et.al.Jn"pff-Line Cursive Script Word 
Recognition". IEEE Transactions describes an 8-connected process, for example,, prT/Pafte^ Machine 
Intelligence. Vol. 11 , No. .1 (January 1 989). In the situation „of detecting multiple objects, one rna in object would pref- 
erably be selected using any pf a number .of techniques. Fpr example, object selection, could be accomplished by 
retaining the object with the largest perimeter or based upon the location pf the object/.. . ^ 
[0)041] Once the object has been located, as described with respect to step 1 00, the oext general step, step 200. is 
to model the shape of the object. For purposes ot .simplicity, the folki>wing descriptiori/Will .Jreat rectangular-shaped 
objects. However, it will be appreciated that "the description of the invention is extensible to other polygons and even 
to shapes haying portions thereof represented by curves (e.g.. circular or elliptical objects). The result or output from 
step 100 is a set'of bounding pixels or a set of edge\traces preferably organized in the form of a linked list. These pixels 
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ortraces-c§h be used to extract the' object, but oHe'htati6n''ii^' 'noiVet determined. To improve the quality of the object 
extraction, the object traces'are flfted'to a mbdelsH?^^^^ 6'rieHtation information and other characteristics of the object 
^'may then* be^xtract^d^frorh' the f^^ 

[0'G42] • At step 400'"bf-'FigUre'7; the 'bifi' or result bt' step 100' i^e'.'^t'hVstefD of bounding pixels or set of edge traces in 
the form of a linked list Is defined. At step 4l0,'af ed(jctibri of the'bata-set is necessary. The linked list defined at step 
400 may be too lar'ge; 'ihfact; it may be in the^order of "1 ;dGG-data points or more' Therefore, it is necessary to reduce 
the amount of data of these~1 ",000 or so data points.' 'Using a 1^ the total data p'dints does such data 

reduction is performed. Typically the factor is around the value of 5, or may be another factor that enables a 100-200 
point's border. For example, if there were 1 ,000 data points, to reduce such a data set to within 100-200 points would 
require a factor between 5 and 10. the data set is then reduced to a set of 100-200 points. 

[0043] After the data reduction, at step 420 of Figure 6, an estimation of the angle of the line passing through each 
remaining point on the contour is prefornried. As shown in Figure 7, a modified linear regression in a particular window 
(W) centered on each point is perfor'med'so as to estimate an angle of the line passing through each remaining point 
of the contour determined by the set of data points. Initially a modified linear regression is done on a small window 
centered on a point (A) where each linear regression requires a series of additions, multiplication's, and arc tangent 
calculations. 

[0044] In particular, a standard regression algorithm fits data to the line y=mx+k. However, using a standard regres- 
sion algorithm to perform these functions can lead to erratic results as the line approaches a vertical. Therefore, in the 
present invention, the algorithm is modified so as to account for these erratic results. As shown in Figure 8 in each of 
Ihe four quadranls indicated therein, a standard regression is used for the lines that are more horizontal in two quadrants 
with a slope between -45°'andV45**.' WIne'n'the ^I6'pe is ribt in the identified degrees, that is, when the lines are more 
vertical with a slope greater than 45° and in the other two quadrants, an' inverted linear regression based on the inverted 
linear equations x-1/ny-k/m is performed. The slope angle is determined from the following equations: 

f avg (xy) - avg(x)avg(y) I > I avg(x^ ) - avg(x)^ I 

when the slope is more vertical 

I avg(xy) - avg(x)avg(y) I < I avg(x^ ) - avg(x^) I 
when the slopfe is more horil^ohtar '^" ■ ' ^ ' 

^ [0045] Once the slope calculation are'accomplished, at step 430 of Figure 6, each point associated with an angle Is 
categorized by perforrnihg a bln:categorlzing operation to generate a series of bins. For example, as shown In Figure 
9, bins B1 , B2, B3, and B4..: are generated from a series of angles, which are associated with each point. The object 
of ste'p 430 is; to'categorize groups of adjacent boundary points that share a common slope. le! convert the list of 
bbundary points Into a ^equeh^ of bins (BT, B2. B3...), where each of the bins cohsists of a set of collinear points so 
^ as to generate, a* bouhcfiiy^^ up of a set of straight line segments., \ . 

[004^] When generatihg the 'above-described line segniients, boundary point angles of the predecessor points may 
■ differ from the angle of Ihe current point by a substantial amournt. If, for example, the boundary point' angle differs from 
the previous pbiht^^ thkn a particular amount E, then the point Is added to the current bin. . If the boundary point 
angle differs E,'then it is assigned^o a new bin/Each bin (B1, B2, B3...) represents 
'a'sefbf coiltheaFpoihts^ ' ^ , 

' [d047]' ''bhce'the bfri categorization is; completed.and the eachp'bint is assigned with the appropriate bin some of the 
'bins cbntairi caitegorized contain to few data points. Therefore at step 440 of Figure 7' it 1^ necessary to remove the 
' bins which coritairi.feW or less than a minimum number of data points. Typiciaily this removal is performed when there 
are approximately 5 data pblnts. As an example of this removal in Figure 10, the smaller bins A, B, and C are removed 
because they contain less than 5 data points. * \ • • w'. 

" [6648] * Aft^rt'he 'bin removal of the shnaller bins,'';an average angle of points V it hiri each.ofjfie remaining bins is 
determined at step 450; The same modified regresslons.descrlbed above jare'hovv applied the 'remaining bins gen- 
erating ah average sibpe and ayerage slope angle calculation for each of the remaining bins.,. 
[6649] ' Mbxti'the 'remaining bihs that shciVe comhnbn yngles'are cbmbihedat;step 460.Tbese/i:ernaining bins were 
' previously 'not a<djacent to each othefr But "became ad jac found In between 

' the non-iaidjacent bihs. When thes'febths In betwein ai-e Vemo^^^ the bins that are left aindlhat are now adjacent can 
sometimes be collinear. Therefore, the angle of^each'bjri is checked "and If adjacent bjris7a re collinear, the bins are 
joined into a single biti at step 460. Such k^procedure^ 11 ,"■ were'a'bin ^ by AL-AR and 

another bin defined by BL-BR are compared tb'each' ot?ier.'The rnidpoint between each pbiWt'AM/BM is calculated and 
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the slope of the segment AB is determined and corripared to the slopes of each of the bins. If the slopes are within 
10°. an amount designated as^ E, then.the; following calculation js performed^ . ■ . > r . . . 
[0050] (Slope (AB) - Slope (A)) < E & (Slope(Ae)'- Slope(B)) < E where E is approximately 10 degrees. There are 
instances when the slope calculations, are susceptible to noise in short bins, hence the value is modified if one of the 
bins is determined to be very short with respect to the other bin. . - 

[0051] Finally, at step 470. the intersection jbet ween each bin and the next bin is calculated. The midpoint x, y, and 
slope angle M of each bid is used to calculate the. iotersection xy between the two bins. 

■• ■ kg = - m^x^ ' - ■■ ' - • . 



20 . S . , ' ' ' ' . * ' 

[0052] The angle of intersection is calculated from the vector cross product between vectors pointing from the inter- 
section point to the midpoints: _ ^ V • - 

2s A x B ^ [(x1 -x)(y2-y) - (x2-x)(y1 -y)] k 

' ' = [A] [B] sin (9) k 

50 e = sin'^ {(xl -x) (y2-y) - (x2-x) (y 1 -y) / 

[0053] At this stage of the procedure the edge boundary has been reduced to a small number of bins. The next step 
is to apply a recognition algorithm because the final number of bins usually is 4 for rectangular images. However, there 
are cases when there are more than 4 rectangular images and so it is necessary to apply a recognition algorithm. This 

35 results because smart platens sometimes have regions that are always easily distinguished from the background im- 
ages and the boundaries of the detected images bleed into the background images". Because of the bleeding, the 
detected boundaries do not match tl:ie actual images in certain locations. Therefore, the boundaries fail to fit the actual 
image and it is necessary to apply further procedures to make the boundaries fit particular images, 
[0054] At step 500 of Figure 1 2. the list of bins which is generated is browsedjo identi,fy,bins that have 90"* intersec- 

40 tions with neighboring bins or the bins that approximate 90° an gles'withJhVir , neighbors. If, there, are bins that are 
ascertained to be at 90° angles, then these bins are overlooked because ten the .b ins that have these angles are 
from images misaligned against the edges of the platen. Therefore,, if the biplis found'to b^ a right angle then the bin 
is more than likely generated from images' aligned at platen edge and according l^j should' be discarded. Moreover, it 
is necessary to search for all right or 90* angles because these angles also nnay!co^resp^^^ of an image. 

45 To be considered a right angle a bin has to lie between 85-95°.. As iridicated!ear!ienJn^ Figure 7 the angle 

; of irilerseclion is calculated and is used in step 500 to determine whether or not there are right angles. 

. * [0055] Next, as illustrated in Figure 12, at step 530. once the right angle is identified.' the flow prcx^eeds to determine 
■ whether a rectangle can be constructed and can be test fitted to the boundary. To construct a rectangle the following 
steps shown in the flow chart of Figure 13 are performed. 7 . 

so [0056] At step 700 of Figure 13, an estimation of the rotation angle of a rectangle as, the. slope angle of the right 
angle of the bin is performed. Next, at step 710 all the points on the boundary of the rectangle are rotated according 
to the rotation angle" found ip'step 700. At step 720, the axis aligned bounding b6x',!is. cajculated. Then the corner 
^ coordinates ohhe axis aligned bounding box areunrotatedatstep730. The flow th eh proceeds to step 540 of Figure 12. 
_ [0057] At' step 540 of Figure 1 2. a test of the similarity measure or similarity fit is performed by . c^ the number 

55 ^ * of boundary p^oints in each of the t^^o bin's that are at right angles and that lie within a measure of 3 pixels of the resulting 
. . , " .?P-.H^'^'n9 divided by the total number of points in the bin. If the similarity measure is less than'.9 the rectangle test 
is. considered to be a good match arid the flow proQeeds until done. If the similarity test is less than .9 no results are 
found the'flpw proceeds to step 500 for the next pTaJr of .bins at step 545 and is repeated for the next right angle parr 
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of bins. If no angles have sinnilarity thep the flow proceeds to step 550 to an after final search until no bins are left. 
[0058] Next, at step 560 the longest bin is searched and the flow proceeds to determine if a rectangle can be con- 
structed to be test fitted to the boundary. In order to construct the rectangle the following steps are necessary as shown 
in Figure 13. Initially! 'ah 'estimation of the-'rotation^r'angle'is 'deterrnined at step 700, then all the points are rotated at 
-Step 710; a calculatiohidf an axis aligned bou'ridihg-box at 'step 720'''and the corner coordinates of the axis-aligned 
:bounding box ar^e 'unrbtated. The flow then proceeds to -^istelD 580. There a rectangular test fit is performed and is 
determined to be good if the test fit similarity measure is greatelr thah\9 resulting in a good iyiatch to be determined. 
If the similarity me^sUre Is^less'lhan .9 the flow 'procee'ds^back to'step 560 a*nd is repeated aga'irilor the next longest 
■ bin.' If ^the test fit similafity test-fails- again -th'eh^the ^fep prbcfebds Yo the aft^r fim seS'rch with no bins left Step 600. 
The flow then proceeds to Step'^10 where'tW^ -ldh'^'st'bin 'i^ Then a''r6ctar^gl'e is dbnstructed at step 620 

from the longest bin. 

[0059] At this point, a bounding box has been Identified along with its rotation angle. The next step is to eliminate 
those boundaries that are determined not to actual boundaries of the bounding box. If the aspect ratio of the bounding 
rectangle Is more than 15:1 or less than 0.075:1 then a boundary is considered not to be conforming as a boundary 
■;of4he = bounding^ bbx-ahd- is is less than 5 pixels;' the boundary is again 
considered not to be desirable and Is rejected. ' ■'■ ' • ' ' ' . 

[0060] Finally, it is possible that a single Image may show up as two or more boundaries if the bleeding is excessive. 
In-thls case> a cbr^tainrrierit' algoi'ithm is ap'plikj-^ to tfest if a boundary is contain e within another. Each boundary Is 
"lested against all other bbUindai^ies'tb see--if'ther'e-is a 25% or more ' overlap 'between the two boundaries. If such an 
overlap occurs, -the two boundaries are jbiried and the* bounding rectangle is recalculated' (go to step 500). 
[0061] The points that make up a boundary are gen^railly 'tHe verticals of a many-sided polygon. The containment 
test as shown in Figure 14, Is a procedure to calculate the overlap between two polygons PI and P2. 
[0062] ■■1'nitlally, as showh^'in Figure ISv^'a-'deternriihatibn of'the rrilnimum and maximum Pi & P2 y-values (step 800) 
for each pblygon-is-'fierformed.' Ih*the-case b1 Smart 'Piateh7'these values are' already knowri and do not have to be 
recalculated. Next, for each polygon, there is an allocation of an array large enough to store two integers for all possible 
values of y in the polygon (step 810). For exarfiple, if y were to have a range of 332, then it Is necessary to allocate 
an array fbf "332 pointers to two initegers!' Third,' for each pblygdhV X vialues are determined by scanning through the 
vertices and determining the min and max values of X for each Y-value (step 820). However, vertices may not exist for 
each'y-value, therefbre Y-values may^have to be iriterpolated?As ah exarfiple. consider'twd vertices given by (x1,y1 ) 
and (x2,y2) where y2-y1-K: It- is necessary to 'interpolate K-1 values of X; one for each value of y. The algorithm for 
Interpolating x is as follows: 



[CX)63] Once the arrays containing the min and max x-values for each polygon is created at step 810, an overlap 
measurement determination is performed at step 840. 

[0064] The tesi for vertical overlap Is if the least y-value is greater than the largest y-value step 840, there is no 
overlap and then the flow proceeds to step 880. Also, if the largest y-value of P, is less than the least Y-value of P2, 
then there is no overlap, the flow proceeds to step 800. However, for those y-values where both PI and P2 overtap, 
the horizontal overlap Is summed and the following checks are performed. First, if the minimum x-value of PI is greater 
than the maximum x-value of P2 then the flow proceeds to step 860. Second, if the maximum x-value of PI Is less 
than the minimum x-value of P2, then the flow proceeds to step 870 and an estimation of the overtap for that y-value 
is performed as follows: 



(Step 830) 



dx = (x2-xl) / 
for (j=0; j<K; 
{ 



K; 



+ + j) 



X = xl + j*dx; 



Overlap + = Maximum Plj^^j^, P2 ; ) - Minimum (PI 



Xmax 



, P2 
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^ 'xa4x) + '' (step 870)-^ ^ ■ 

[0065] This determination shown above-returns the overlap between two polygons. When applied to the inventions 
5 platen, we normalize the returned overlap area by dividing by the smaller of. ihe.two borders. If the-result exceeds 25%, 
the srrialler boundary is. considered to be an orphan opi a part- of the larger boundary and the two,are merged and a 
new bounding rectangle is calculated,. ; • ...... r > . 

[0066] By performing the above, steps a gocd ^bounding. rectangle.' is. identifjed. Further techniques such as file ex- 
traction are also applicable to further process the received data. Qther kinds of.processing steps may also be performed 
?o for detecting non-rectangular objects, overlapping rectangular objects, etc. ^ 

Claims . . - • . ■ * -r i ■ ... . -. • ■ j-i-^. . ^ ' - . 

J5 1. A rnethod tor processing a digital irnage using an-imaging input deyice so.as to reduce bleeding of the edges of 
the detected images comprising the steps of : , .r . ^ - ^-^r .-■ ..^ • . ~ 

receiving a digitized imagejncluding at least one of a.pluraljty.of representations of objects; and 
processing said digitized irriage into at-least orie of ^a plurality of images. of said digitized image wherein said 
20 processing step includes defining a plurality of. bins having a. pJurality of^edge pojnls making up each of said 

plurality of images of said digitized, image. .^ ... - 

2. A method for processing a digital inrage. using aui-imaging mput device-reducing bleeding of. the edges of the 
detected images by determining edge contours to generate bounding boxes.comprising: 

identifying bins having edges that approximate 90 degrees, and - . 

modeling a rectangular image fitting for bins that approximate the 90.degrees:edges., 

3. A method, for processing a digital iniage using an imaging input device reducing, bleeding. of the edges of the 
30 detected images by determining edge contours to generate bounding boxes comprising: 

identifying bins on said edge contours having edges that approximate 90 degrees, and 
modeling a rectangular image for each the identified bins. 

35 
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